Oracle ASM: Monitoring and Managing (Part 2)

Comments 0

Share to social media

In the first part of this series, we explored the fundamentals of Oracle Automatic Storage Management (ASM)—a powerful volume manager and file system integrated with Oracle Database. ASM simplifies storage by abstracting physical disks into disk groups and automating tasks like striping, mirroring, and rebalancing. This automation helps ensure high availability, scalability, and optimal performance for Oracle workloads.

But once ASM is up and running, how do you ensure it stays healthy and efficient? That’s where monitoring and proactive maintenance come into play.

This article outlines the key areas to monitor in Oracle ASM, why they matter, and what SQL or tools to use for visibility and alerts.

Disk Group Usage

Why it matters: Disk groups are the heart of ASM. If they fill up, database writes can fail, and performance may degrade. Although rare in modern and far less limited storage solutions of today, it’s still important to monitor the free space of any diskgroup.

What to monitor:

Recommendations:

  • Set alerts when usable_file_mb drops below 15–20% of the total disk group size. In larger storage environments, it often makes sense to calculate thresholds by MB free vs. percentage free.
  • ASM may report enough free space, but due to mirroring and striping, usable_file_mb is a more accurate indicator of true usable capacity.

Disk Health and Status

Why it matters: A single failed or degraded disk can compromise redundancy, performance, and rebalance operations. Having a clear understanding of the status and availability of disk groups is essential.

What to check:

Red flags:

  • MOUNT_STATUS = CLOSED or UNKNOWN
  • HEADER_STATUS = FORMER, MISSING, or CANDIDATE (unexpectedly)
  • STATE = OFFLINE or UNKNOWN

Rebalance Operations

Why it matters: ASM automatically rebalances data when disks are added or removed. Frequent or slow rebalances may indicate configuration or I/O bottlenecks. The estimates showing minutes left, as well as the amount of work that’s been completed can assist the DBA in knowing the clear status of a rebalance operation.

What to monitor:

Tips:

  • Track how often rebalances occur.
  • Monitor if operations remain in an active or long-running state.
  • Use POWER level wisely. Although higher values are available in post 11g limits, higher values mean higher IO and CPU to complete faster.

Redundancy and Mirroring Levels

Why it matters: ASM supports different redundancy levels (EXTERNAL, NORMAL, HIGH). Choosing the wrong one for your workload can risk data availability. Mission critical workloads require higher redundancy levels, where development environments can use lower ones.

What to check:

Best practices:

  • Match redundancy levels to workload importance:
  • HIGH: Mission-critical production systems
  • NORMAL: General production or staging
  • EXTERNAL: When using hardware RAID
  • Automate checks to verify that templates align with the expected configuration.

File Access and I/O Statistics

Why it matters: ASM handles all Oracle files—datafiles, redo logs, control files—and poor I/O can severely impact database performance.

Files Overview:

I/O Stats:

Insights:

  • Join these views with V$ performance views for comprehensive performance analysis.
  • Track error counts and abnormal read/write ratios.

ASM Alert Logs and Trace Files

Why it matters: Not all issues appear in SQL views. ASM writes key warnings and errors to its alert logs. Any DBA will know for detailed information on issues or errors, the alert log is the place to go. For ASM, there is a handy tool to provide filtered views of errors.

How to check logs:

Look for:

  • ORA- errors
  • Rebalance messages
  • I/O errors
  • Disk offline or fail events

Tip: Set up log monitoring tools or scripts to scan for critical messages in real time.

ASM and Cluster Instance Availability (RAC Environments)

Why it matters: In Oracle RAC environments, each node runs an ASM instance. ASM unavailability on any node can affect database operations. As RAC has shared storage, it’s essential to monitor the ASM instance on each RAC node. For RAC, as for each database node, the ASM instance must have a unique name for the shared storage. It’s common to number the ASM instance, (i.e. +ASM1, +ASM2…)

What to monitor:

Ensure:

  • All ASM instances are online and registered with Oracle Clusterware.
  • No unexpected restarts or failovers have occurred.

Compatibility and Feature Settings

Why it matters: ASM compatibility levels control access to advanced features like ASM Cluster File System (ACFS), Flex ASM, and more.

Check settings:

Validate:

  • Compatibility levels align with Oracle version and features in use.
  • Features like ADVM and Flex ASM are configured only where supported.

Bonus Tips for Proactive Monitoring

  • Set up custom scripts to check space usage and disk health daily.
  • Leverage Oracle ASMCMD for file-system-like navigation and troubleshooting.
  • Monitor ASM patch levels to ensure they are in line with Oracle’s recommendations.
  • Track fragmentation and rebalance trends to catch early signs of performance issues.

Conclusion

Oracle ASM automates many storage management tasks, but like any critical infrastructure, it requires careful monitoring to ensure reliability and performance. By keeping a close eye on disk group usage, disk health, I/O metrics, and rebalance activity and setting up proactive alerts, you can prevent issues before they impact your database.

Oracle can have many architecture solutions, no matter if it’s a single instance, multi-tenant or Oracle RAC, best practices will help you unlock the full power of ASM while maintaining the high availability and performance your Oracle environment requires.

Article tags

Load comments

About the author

Kellyn Pot'Vin-Gorman

DBAKevlar

See Profile

Kellyn Gorman is the multi-platform database and AI advocate at Redgate. She's been in the tech industry for a quarter of a century, specializing in Oracle, SQL Server, MySQL and PostgreSQL. Her focus on Azure and Google Cloud for high IO workloads on IaaS has been of exceptional interest for data-infra specialists in the tech world. Her content is highly respected under her handle DBAKevlar. She is co-leader of the Data Platform DEI group, an executive board core member for DZone, and mentors around half a dozen people at any given time in multiple communities.